Accuracy of multiple sequence alignments as assessed by reference to structural alignments

نویسنده

  • O. Gotoh
چکیده

In the last 20 years, many multiple-sequence alignment programs based on various principles have been developed. Continuous e orts have been devoted to solve two major problems: (1) how to evaluate the 'goodness' of an alignment, and (2) how to get the alignment with the optimal score. These problems are tightly interrelated, and other criteria are needed to objectively assess reliability of a certain alignment method. Recently, the number of protein three-dimensional (3D) structures determined by X-ray crystallography and high-resolution NMR methods is rapidly increasing. Comparison of the 3D structures makes it possible to align distantly related protein sequences based on their structural equivalence. A few collections of such structure-based alignments are now available [4]. Hence we can assess the quality of sequence alignments obtained by a given method by referring to the structural counterparts. McClure et al. [3] recently reported that the! most popular 'progressive' metho

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

COFFEE: an objective function for multiple sequence alignments

MOTIVATION In order to increase the accuracy of multiple sequence alignments, we designed a new strategy for optimizing multiple sequence alignments by genetic algorithm. We named it COFFEE (Consistency based Objective Function For alignmEnt Evaluation). The COFFEE score reflects the level of consistency between a multiple sequence alignment and a library containing pairwise alignments of the s...

متن کامل

3DCoffee: combining protein sequences and structures within multiple sequence alignments.

Most bioinformatics analyses require the assembly of a multiple sequence alignment. It has long been suspected that structural information can help to improve the quality of these alignments, yet the effect of combining sequences and structures has not been evaluated systematically. We developed 3DCoffee, a novel method for combining protein sequences and structures in order to generate high-qu...

متن کامل

Profile alignment scoring functions A comparison of scoring functions for protein sequence profile alignment

Motivation: In recent years, several methods have been proposed for aligning two protein sequence profiles, with reported improvements in alignment accuracy and homolog discrimination versus sequence-sequence methods (e.g. BLAST) and profile-sequence methods (e.g. PSIBLAST). Profile-profile alignment is also the iterated step in progressive multiple sequence alignment algorithms such as CLUSTAL...

متن کامل

MUMMALS: multiple sequence alignment improved by using hidden Markov models with local structural information

We have developed MUMMALS, a program to construct multiple protein sequence alignment using probabilistic consistency. MUMMALS improves alignment quality by using pairwise alignment hidden Markov models (HMMs) with multiple match states that describe local structural information without exploiting explicit structure predictions. Parameters for such models have been estimated from a large librar...

متن کامل

A comparison of scoring functions for protein sequence profile alignment

MOTIVATION In recent years, several methods have been proposed for aligning two protein sequence profiles, with reported improvements in alignment accuracy and homolog discrimination versus sequence-sequence methods (e.g. BLAST) and profile-sequence methods (e.g. PSI-BLAST). Profile-profile alignment is also the iterated step in progressive multiple sequence alignment algorithms such as CLUSTAL...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997